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CN ! Abstract 

We present an 0(n^'^)-time algorithm for maintaining the topological order of a directed acyclic 
' graph with n vertices while inserting m edges. This is an improvement over the previous result of 
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A topological order T of a directed acyclic graph (DAG) G = (V, E) is a linear order of all its vertices 
' such that if G contains an edge {u,v), then T{u) < T{v). In this paper we study an online variant of 
the topological ordering problem in which the edges of the DAG are given one at a time and we have 
Q , to update the order T each time an edge is added. Its practical applications can be found in [2l[5l[7]. 
In this paper, we give an 0(n^'^)-time^ algorithm for online topological ordering. 

^ . 1.1 Related Work 



Alpern et al. [2\ gave an algorithm which takes 0(| |(5| | log 1 15| |) time for each edge insertion, where 
\\6\\ is a measure of the change. (For a formal definition of \\d\\, please see [2l[71[8].) Pearce and Kelly 
[7] proposed a different algorithm which needs slightly more time to process an edge insertion in the 
worst case than the algorithm given by Alpern et al. [_2j, but showed experimentally their algorithm 
perform well on sparse graphs. 

Marchetti-Spaccamela et al. [6j gave an algorithm which takes 0{mn) time for inserting m edges. 
Katriel [3] showed that the analysis is tight. Katriel and Bodlaender [3] modified the algorithm 
proposed by Alpern et al. [2], which is referred to as the Katriel-Bodlaender algorithm. Katriel and 
Bodlaender proved that their algorithm has both an 0(min{m^/^ log n, rr?^'^ + v? log n}) upper bound 
and an r2(m^/^) lower bound on runtime for m edge insertions. Katriel and Bodlaender also analyzed 
the complexity of their algorithm on structured graphs. They showed that the Katriel-Bodlaender 
algorithm runs in time 0(mA;log^ n) where k is the treewidth of the underlying undirected graph and 



^The symbol O means O with log factors ignored. Depending on the implementation, the runtime may vary from 
0(n2 '5 log2 n) to ©(n^ Mogn). 
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can be implemented to run in O(nlogn) time on trees. In [9], we proved that the Katriel-Bodlaender 
algorithm takes 0(m^/^ +mn^/^ log n) time for inserting m edges. Recently, Ajwani et al. [Tj proposed 
an 0(n^'^^)-time algorithm, independent of the number of edges m inserted. To the best of our 
knowledge, it is the best result for dense DAGs. 

2 Algorithm 

We keep the current topological order as a bijective function T : V ^ [l..n]. Let d{u,v) denote 
\T{v) — T{u)\, u ^ V express that there is an edge from uto u v express that there is a path from 
uiov and n < u be a short form of T{u) < T{v). Let n°'^ < to < *i < *2 < • • • < tp~i <tp< tp+i = 
where p = O(logn) is a nonnegative integer. In Section [6l we shall show how to determine the values 
of these parameters. 

Figure [1] gives the pseudo code of our algorithm. T is initialized with the topological order of 
the starting graph. Whenever an edge (u, v) is inserted into the graph, lNSERT(ti, v) is called. If 
u < V, then lNSERT(tt, v) does not change T and simply insert the edge into the graph. If u > v, then 
lNSERT(it, v) calls Reorder(w, u, 0, 0) to update T such that T is still a valid topological order and 
T{u) < T{v). After the call to Reorder(u, u, 0, 0), lNSERT(n, can safely insert the edge into the 
graph. 

It remains to explain how the procedure Reorder(u, /i, /2) works. The duty of the procedure 
REORDER(n, f , /i, /2) is to update T such that T is still a valid topological order and T{u) > T{v). 
The flag /i = 1 indicates that the set A' = {w : u ^ w and w < v} has been known to be empty. 
The flag /2 = 1 indicates that the set B' = {w : w ^ v and w > u} has been known to be empty. If 
T{u) > T{v), then we directly exit. Otherwise, there are two cases to consider: 

1: ti < d{u,v) < tj+i for some i = 0, . . . ,p. In this case, we first have to compute Ai = {w : u — > 
w, d{u,w) < ti, and w < v} and Bi = {w : w ^ v, d{w,v) < ti, and w > u}. If = and 
/i = 0, then we still have to compute ^i+i = {w : u ^ w, d{u,w) < tj+i, and w < v} and set 
A = Ai^i; otherwise, we directly set A = Ai. Similarly, if = and /2 = 0, then we still have 
to compute -Bi+i = {w : u ^ w, d{u,w) < tj+i, and w < v} and set B = Bi^i; otherwise, we 
directly set B = Bi. 

2: d{u,v) < to- In this case we directly set A = Aq = {w : u ^ w, d{u,w) < Iq, and w < v} and 
B = Bq = {w : w ^ V, d{w, v) < to, and w > u}. 

If both A and B are empty, then we directly swap u and v and exit the procedure. Otherwise, let 
Torigiani be the topological order at the start of the execution of the procedure. For each u' £ {u} U A, 
considered in order of decreasing Toriginai{u'), we do the following. For each v' £ {v' : v' G B U 
{v} and Toriginaiiv') > Torginai{u')}, Considered in order of increasing ToriginaiW) , recursively call 
REORDER(n', ti', /{, /g). The first flag /{ is set to 1 if and only \i u' = u and A = and the second 
flag /2 is set to 1 if and only \i v' = v and i? = 0. 

The idea behind the algorithm. Our algorithm broadly follows the algorithm by Ajwani et al. [1]. 
The main difference is that Ajwani et al. always set A to ^j+i and B to -Bi+i during the execution 
of Reorder but we set A to ^j+i only if = and B to Bi+i only if Bi = 0. We shall prove that 
the total number of calls to Reorder won't increase (bounded above by 0(n^)) by introducing this 
modification. Thus intuitively, our algorithm should run faster because in each call to Reorder we 
might only need to compute Ai and Bi instead of Aj+i and 
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lNSERT(tt, v) 

{* Insert edge {u, v) and calculate new topological ordering *) 

1 if V < u then Reorder(v, u, 0, 0) 

2 insert edge {u, v) in graph 

REORDER(tl, V, fi, /2) 

(* Reorder vertices between u and v such that v < u *) 



1 if 


V < u then exit 




2 if < d{u, v) < ij+i 




3 


then 




4 


Ai := {w : u — ^ w, d{u, w) < U, and w < v} 




5 


Bi := {w : w ^ v, d{w, v) < ti, and w > u} 




6 


A := Ai if 7^ or /i = 1; otherwise, A := Aj+i 


:= {w : u ■ 


7 


B := if / or /a = 1; otherwise, 5 := Sj+i 


:= {w : w 


8 


else 




9 


^ := := {tu : « — ^ w, d{u, w) < to, and w < v} 




10 


B := Bq := {w : w ^ v, d{w,v) < to, and w > u} 




11 if yl = and 5 = 




12 


then 




13 


swap u and v 




14 


else 




15 


for u' G {n} U yl in decreasing topological order 




16 


for v' £ B U {v} f\v' > u' in increasing topological order 


17 


/( := 1 if (n = u' and A = 0); otherwise, /( : 


:= 


18 


/2 := 1 if (i' = f' and S = 0); otherwise, /2 : 


= 


19 


Reorder(«',v',/(,/^) 





w, d{u,w) < tj+i, and w < v} 
> u, d{w,v) < tj+i, and u; > 



Figure 1: Our algorithm. 

3 Data Structures 
3.1 Main Data Structures 

In the following, we shall describe the main data structures used in our algorithm. 

The current topological order T and its inverse are stored as arrays. Thus finding T(i) and 
T~^{u) can be done in constant time. 

The DAG G = {V, E) is stored as an array of vertices. For each vertex u we maintain two adjacency 
lists InList{u) and OutList{u). The backward adjacency list InList[u] contains all vertices v with 
{v, u) G E. The forward adjacency list OutList{u) contains all vertices v with (u, v) € E. Adjacency 
lists are implemented by using n-bit arrays and support the following operations. 

1. List-Insert: Given a vertex and a list, add the vertex to the list. 

2. List-Search: Given a vertex and a list, determine if the vertex is in the list. If yes, return 1. 
Else, return 0. 

Since the adjacency lists are implemented by using n-bit arrays, it takes 0(1) time per List-Insert 
or List-Search operation. 
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3.2 Auxiliciry Data Structures 

In the following we describe some auxiliary data structures which are used in our algorithm to improve 

the time complexity. For each vertex n, we maintain two arrays of pails: InPails{u)[0 ■ ■ ■ p + 1] and 
OutPails{u)[0 ■ ■ ■ p + 1]. InPails{u)[i] contains all vertices v with < d{v,u) < ti and {v,u) £ E. 
OutPails{u)[i] contains all vertices v with < d(u,v) < ti and (u,v) G E. A vertex v m a pail is 
stored with its vertex index (and not T{v)) as its key. Pails are implemented by using balanced binary 
search trees and support the following operations. 

1. Pail-Insert: Given a vertex and a pail, add the vertex to the pail. 

2. Pail-Delete: Given a vertex and a pail, delete the vertex from the pail. 

3. Pail-Collect- All: Given a pail, report all vertices in the pail. 

It takes O(logn) time per Pail-Insert or Pail-Delete and 0(1 + 7) time per Pail-Collect- 
All, where 7 is the number of vertices in the pail. 

3.3 Instructions for Data Structures 

Given a DAG G with a valid topological order and two vertices u and v with u -/^ v, define sorted 
vertex sets Ai and Bi, i = 0, . . . ,p + 1, as follows: 

Ai = {w : u ^ w and d{u, w) < ti and w < v} sorted by the topological order. 
Bi = {w : w ^ V and d{u, w) < ti and w > u} sorted by the topological order. 

In the following we discuss how to insert an edge, compute vertex sets Ai, and Bi, and swap two 
vertices in terms of the above five basic operations. 

a. Inserting an edge {u,v): This means inserting vertex v to the forward adjacency list of u and 
u to the backward adjacency list of v. This requires two List-Insert operations and at most 
2(p + 2) Pail-Insert operations. Thus inserting an edge {u, v) can be done in 0{plogn) = 
time. 

b. Computing A^ and Bf. Ai can be computed by sorting the vertices in OutPail{u)[i] and choosing 
all w with w < V. This can be done by first calling Pail-Collect-All to collect all the vertices 
in OutPail{u)[i\ in 0{ti) time. Note that for all these vertices w, we have < T{w) — T{u) < ti. 
Thus we can sort these vertices in 0{ti) time by counting sort and then choose all w with w < v 
in 0(|^i| -I- 1) time. The total time required to compute Ai is 0{ti + \Ai\) = 0{ti). Similarly, 
the time required to compute Bi is 0{ti). 

c. Computing Ai and Bi when ti < d{u,v) < ti+i: Ai can be computed by sorting the vertices in 
OutPail{u)[i]. This can be done by first calling Pail-Collect-All to collect all the vertices in 
OutPail{u)[i] in 0(|Aj|-|-l) time, and then sorting these vertices in 0((|v4j|-|-l) log n) time. Thus 
the total time required to compute Ai is -|- 1). Similarly, the time required to compute 
Bi isO(|B,| + l). 

d. Swapping u and v: Without loss of generality assume u < v. When swapping u and v, we need 
to update the pails, T, and T~^. We now show how to update the pails. For all vertices w with 
T{u) — ti < T{w) < mm{T{u),T{u) — ti + d{u,v)}, we delete w from InPail{u)[i] and delete u 
from OutPail{w)[i]. For all vertices w with max{T{v) +ti — d(u, v),T{v)} < T{w) < T{v) + ti}, 
we delete w from OutPail(v)[i\ and delete v from InPail{w)[i]. It requires total 0{d(u, v)) Pail- 
Delete operations for each i. For all w with max{T{v),T{u) +ti} < T{w) < T{u)+ti + d{u, v), 
if w is in the forward adjacency list of u, then insert w into OutPail(u)[i] and insert u into 
InPail{w)\i]. For all w with T{v)-ti- d{u,v) <T{w) < min{T{u),T{v) -ti + d{u,v)}, if -u; is in 
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the backward adjacency list of v, then insert w into InPail{v)[i] and insert v into OutPail{w)[i]. 
It requires total 0{d{u,v)) List-Search and Pail-Insert operations for each i. In total, we 
need 0{p ■ d{u,v)) List-Search, Pail-Insert, and Pail-Delete operations. Updating T and 
is trivial and can be done in constant time. Thus the total time is 0{p ■ d{u,v)logn) = 
d{d{u,v)). 

4 Correctness 

In this section, we shall argue that our algorithm is correct. We say a call of a recursive procedure 
leads to an operation "by itself" if and only if this operation is executed during the execution of 
this call and not during the execution of subsequent recursive calls. Given a DAG G with a valid 
topological order T and two vertices u,v of G with u < v, let A' = {w : u ^ w and w < v} and 
B' = {w : w ^ V and w > u}. We say the flag /i of the call to REORDER(n, v, /i, /2) is correctly set 
only if (/i =^ {A' = 0)) = 1. That is, if / = 1, then A' is empty. We say the flag /2 of the cah to 
REORDER(n, V, fi, f2) is correctly set only if /2 {B' = 0) = 1. That is, if /2 = 1, then B' is empty. 

Lemma 1: Given a DAG G with a valid topological order and two vertices u,v of G with u < v, let 
A' = {w : u ^ w and w < and B' = {w : w ^ v and w > u}. If the flags are correctly set, then 
in the call of Reorder('u, f , /i, /2), A := if and only if A' = 0. Similarly, if the flags are correctly 
set, then in the call of REORDER(n, v, /i, /2), i? := if and only if B' = 0. 

Proof: We shall only prove that A := if and only if A' = 0. It can be proved in a similar way that 

5 := if and only if i?' = 0. There are two cases to consider. 

Case 1: ti < d{u, v) < tj+i for some i with < i < p. By the algorithm, A := if and only if 

Ai = {w : u ^ w and w) < ti and w < v} = ^ and 

= {w : u ^ w and d{u, w) < tj+i and u; < f } = V /i = 1). 

By ti < d{u,v) < tj+i, we have Ai C Aj+i = A'. From Ai C Aj+i = A' and (/i {A' = 0)) = 1, we 
conclude that ^ := if and only if A' = 0. 

Case 2: d{u,v) < tQ. By the algorithm, A := if and only if 

Aq = {w : u ^ w and d{u, w) < to and w < v} = ^. 

By d(u, v) < to, we have = From Aq = A' , we conclude that A := if and only if A' = (/>. □ 

Lemma 2: Given a DAG G with a valid topological order and two vertices u,v of G with u < v, let 
A' = {w : u ^ w and w < v} and B' = {w : w ^ v and w > u}. If the flags are correctly set, then 
Reorder(m, V, /i, /2) leads to a swap by itself if and only if ^' = and B' = 0. 

Proof: By the algorithm, the call to Reorder(u, /i, /2) leads to a swap by itself if and only if in 
this call A := and B := 0. By Lemma [H A := and 5 := if and only if A' = and B' = %. □ 

Given a DAG G with a valid topological order, REORDER(ii, fi, /2) is said to be local if and only 
if the execution of REORDER(ti, v) will not affect T{w) for all w with w > v or w < u. 

Lemma 3: Given a DAG G with a valid topological order and two vertices u,v with u -/^ v, if the 
flags are correctly set, then REORDER(n, /i, /2) maintains a valid topological order and stop with 
V < u and is local. 
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Proof: We prove the lemma by induction on T{v) — T{u). When T[u) — T[v) < 0, the lemma is 
trivially correct. 

Assume the lemma to be true when T{v) — T{u) < k, where A; > 0. We shall prove that the lemma is 
true when T{v) — T{u) = k. \i A' = {w : u ^ w and w < v} = ^ and B' = {w : w — > v and w > u} = 
0, then by Lemma El line 13 is executed. Thus, REORDER(n, /i, /2) maintains a valid topological 
order, stops with v < u, and only T(n) and T{v) are updated, so the lemma follows. If ^' 7^ or 
B' 7^ 0, by Lemma O the for-loops are executed. Let T' be the initial topological order. By our 
induction hypothesis and Lemma [H the following loop invariants hold: 

1. T is a valid topological order. 

2. At the start of the execution of line 19, T{v') - T{u') < k and T'(n) < T{u') < T{v') < T'{v). 

3. At the start of the execution of line 19, u' -/> v' . 

4. The flags are correctly set for the recursive call. 

By the loop invariants and our induction hypothesis, each recursive call REORDER(n', r;', /{, /j) in 
the for-loops stops with v' < u' and is local. Since the last recursive call is REORDER(n, u), the 
entire procedure stops with v < u. Since each recursive call REORDER(ti', w') is local and starts 
with T'{u) < T{u') < T(v') < T'{v), the topological order of vertices w with T'{w) > T'{v) or 
T'[w) < T'{u) is not affected. Thus the entire procedure maintains a valid topological order, stops 
with V < u, and is local. □ 

Theorem 1: Given a DAG G with a valid topological order and two vertices u,v of G with u -/^ v, 
if the flags are correctly set, then lNSERT(n, will add an edge {u,v) to G and maintain a valid 
topological order. 

Proof: Because u -/^ v, we know that u and v are two different vertices and either u < v 01 u > v. 
If n < w then the theorem is trivially correct. Assume that v > u. By Lemma El Reorder(?;, u, 0, 0) 
will stop with u < V and maintain a valid topological order. Thus when line 2 of Insert is ready to 
be executed, we will have a valid topological order and u < v, and adding an edge (u, v) to G won't 
affect the validness of the topological order. □ 

In addition to the correctness of the algorithm, we also want to prove that the flags are always 
correctly set. 

Lemma 4: Given a DAG G with a valid topological order and two vertices u,v of G with u < v, 
consider a call to REORDER(ii, f , /i, 72). If the flags /i and /2 are correctly set, then while executing 
this call, all subsequent calls to Reorder will also own correct flags. 

Proof: We prove the lemma by induction on the depth of the recursion tree. By Lemma [3l the call to 
Reorder(m, /i, 72) will stop, so the depth of the recursion tree is finite. If the depth is zero, then 
no recursive calls are made and the lemma follows. 

Assume the lemma to be true when the depth of the recursion tree is less than k, where A; > 0. 
We shall prove that the lemma is true when the depth of the recursion tree is k. Since A; > 0, there is 
at least one recursive call. Thus the for-loops are executed. By Lemma[T]and LemmaEl the following 
loop invariants hold: 



1, 
2, 
3 
4 



T is a valid topological order. 

At the start of the execution of line 19, T{u') < T(v'). 
At the start of the execution of line 19, u' -/^ v' . 
The flags are correctly set for the recursive call. 
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By the loop invariants and our induction hypothesis, each recursive call to Reorder('u', v', /(, /g) in 
the for-loops, together with all subsequent calls to Reorder in it, own correct flags, and the lemma 



Theorem 2: Given a DAG G with a valid topological order and two vertices u,v oi G with u •/> f , 
while executing lNSERT(n, the flags are correctly set for all calls to Reorder. 

Proof: If n < f then there are no calls to Reorder made while executing lNSERT(ii, t;), and the 
lemma follows. If u > v then lNSERT(ii, w) will call Reorder(?; , u, 0, 0). By Lemma [H the call to 
Reorder(?;, ti, 0, 0), together with all subsequent calls to Reorder in it, own correct flags, and the 
lemma follows. □ 



5 Runtime 

In this section, we analyze the time required to insert a sequence of edges. By Theorem [21 the flags 
are always correctly set. To avoid unnecessary discussion, each lemma, theorem, corollary, and proof 
in this section is state under the assumption that the flags are correctly set. To avoid notational 
overload, sometimes we shall just write REORDER(n, v) and ignore the flags. 

5.1 Properties 

Lemma 5: Given a DAG G with a valid topological order and two vertices u and v with u < v, then 
during the execution of REORDER(n, t'), we have that (1) T{x) is nondecreasing if n x; (2) T(y) is 
nonincreasing ii y v; and (3) T{z) doesn't change ii u -/^ z and z -/^ v. 

Proof: We prove the lemma by induction on the depth of the recursion tree. By Lemma [3l the call 
to REORDER(ti, v) will stop, so the depth of the recursion tree is finite. If the depth is zero, then no 
recursive calls are made. It follows that line 13 is executed, so the lemma follows. 

Assume the lemma to be true when the depth of the recursion tree is less than k, where A; > 0. 
We shall prove that the lemma is true when the depth of the recursion tree is k. Since k > 0, there is 
at least one recursive call. Thus the for-loops are executed. By Lemma[T]and LemmaO the following 
loop invariants hold: 



Note that for each recursive call Reorder(u', v') in the for-loops, we have u u' and v' v. Let 
u X. Then we have either u' x or (u' /> x and x •/> v'). By the induction hypothesis, T(x) is 
nondecreasing or doesn't change during the execution of the recursive call. Thus T{x) is nondecreas- 
ing if X. Let y V. Then we have either y v' or {u' y and y -/^ v'). By the induction 
hypothesis, T{y) is nonincreasing or doesn't change during the execution of the recursive call. Thus 
T(y) is nonincreasing if y v. Let u -/^ z and z v. Then n' /> z and z -/^ v' . By the induction 
hypothesis, T{z) doesn't change during the execution of the recursive call. Thus T{z) doesn't change 
if u -/^ z and z -/^ v. □ 



follows. 



□ 



1 

2 
3 
4 



T is a valid topological order. 

At the start of the execution of line 19, T{u') < T{v'). 
At the start of the execution of line 19, u' -/^ v' . 
The flags are correctly set for the recursive call. 
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Lemma 6: Given a DAG G with a valid topological order and two vertices u and v with u < v, for 
all X and y, REORDER(ti, v) leads to at most one swap of x and y. 

Proof: Suppose that REORDER(ii, v) leads to at least one swap of x and y. Without loss of generality 
we assume that x < y before the the first swap occurs. Then the first swap of x and y leads to increase 
of T{x) and decrease of T[y). Thus by Lemma [5l T{x) is nondecreasing and T{y) is nonincreasing 
during the execution of Reorder(w, v). After the first swap, we have x > y. Since T{x) is nonde- 
creasing and T{y) is nonincreasing, we know there are no more swaps. □ 

Theorem 3: While inserting a sequence of edges, for all vertices x and y, after the first swap of x 
and y, the relative order of x and y won't change. 

Proof: Suppose that the vertex pair (x, y) is swapped at least once while inserting a sequence of 
edges. By LemmaOand the algorithm Insert, each edge insertion leads to at most one swap of x and 
y. Let (u, v) be the first edge whose insertion leads to a swap of x and y. Without loss of generality we 
assume that x < y before the the first swap occurs. We shall prove that x > y will hold after the first 
swap of X and y. Consider the execution process of Insert(«, u). The first swap occurs during the 
execution of Reorder(w, u). By Lemma [5l we have v x and y u. Since T{x) is nondecreasing 
and T(y) is nonincreasing, after the first swap of x and y, x > y will hold until Reorder(?;, u) returns. 
After REORDER(f , u) returns, the edge (n, v) will be added to the graph. Thus we will have y x and 
X > y just after Insert(u, u) returns. By Lemma [3l calls to Reorder maintain a valid topological 
order, so x > y will hold hereafter. □ 

Corollary 1: While inserting a sequence of edges, for all vertices x and y, there is at most one swap 
of x and y. 

Lemma 7: Given a DAG G with a valid topological order and two vertices u and v with u < v, 
REORDER(n, v) leads to a swap of u and v. 

Proof: We prove the lemma by induction on the depth of the recursion tree. By Lemma [3l the call 
to REORDER(ti, v) will stop, so the depth of the recursion tree is finite. If the depth is zero, then no 
recursive calls are made. It follows that line 13 is executed, so the lemma follows. 

Assume the lemma to be true when the depth of the recursion tree is less than k, where A; > 0. 
We shall prove that the lemma is true when the depth of the recursion tree is k. Since k > 0, there is 
at least one recursive call. Thus the for-loops are executed. By Lemma[T]and Lemma[2l the following 
loop invariants hold: 



Note that when executing the last recursive call Reorder(m', v') in the for-loops, we have u' = u and 
v' = V. By our induction hypothesis and the loop invariants, the last recursive call leads to a swap of 



1, 
2, 
3, 
4, 



T is a valid topological order. 

At the start of the execution of line 19, T{u') < T{v'). 
At the start of the execution of line 19, u' -/^ v' . 
The flags are correctly set for the recursive call. 



u and V, and the lemma follows. 



□ 



Online Topological Ordering 



9 



Theorem 4: While inserting a sequence of edges, the summation of + over all calls of Reorder 



Proof: Consider arbitrary vertices u and v. We shall prove that v G B occurs at most once over all 
calls of REORDER(tt, •). This proves that the summation of \B\ over all calls of Reorder(?x, •) is less 
than or equal to n. Therefore the summation of \B\ over all calls of Reorder(-,-) is less than or 



Consider the execution process of the first call of Reorder(u, •) for which v G B. By the algorithm, 
a recursive call to REORDER(ti, v) is made in the for-loops. Before the recursive call to Reorder(u, v) 
in the for-loops, at the start of the execution of each recursive call to REORDER(n', in the for- 
loops, we have u < v and {u < u' < v' or u = u' < v' < v). This follows from the order in which 
we make the recursive calls and the local property (Lemma [3]). Since u < v and {u < u' < v' 
or u = u' < v' < "0), by the local property, u < v will hold during the execution of the call to 
REORDER(n', f'). Thus before the recursive call to Reorder(u, -0) in the for-loops, all recursive calls 
to REORDER(n', v') in the for-loops won't lead to a call to REORDER(n, •) for which v € B; otherwise, 
the call to REORDER(ii, •) for which v £ B will lead to a call to REORDER(n, v) which will further lead 
to V > u hj Lemma [3l leading to a contradiction. Suppose for the contradiction that the recursive 
call to REORDER(ti, -0) in the for-loops leads to a call to REORDER(ii, u") for which v E B. By the 
order in which we make the recursive calls and the local property, at the start of the execution of the 
recursive call to REORDER(n, ?)) in the for-loops, we have u < v. Since v ~^ v" , at the start of the 
execution of the recursive call to REORDER(n, in the for-loops, we have u < v < v" . Thus by the 
local property, v" > u will hold during the execution of the recursive call to REORDER(ti, v) in the 
for-loops. However, by LemmaO REORDER(ti, u ") stops with v" < u, which is a contradiction. After 
the recursive call to REORDER(ti, in the for-loops, we have v < uhy Lemma El Since v > u before 
the recursive call to REORDER(it, -0) in the for-loops, by Lemma [71 this recursive call leads to a swap 
of u and v. Thus after the recursive call to REORDER(ti, v) in the for-loops, we will have v < u and, 
by Lemma |3l the relative order of u and v won't change hereafter. Since v < u holds hereafter, there 
will be no more calls of REORDER(n, •) for which v £ B. Putting all things together, it follows that 
V G B occurs at most once over all calls of REORDER(n, •). 

Similarly, we can prove that for arbitrary vertices u and v, u € A occurs at most once over all calls 
of Reorder(-, ?;). It follows that the summation of |^| over all calls of Reorder(-, •) is less than or 
equal to n^. □ 

Corollary 2: While inserting a sequence of edges. Reorder is called O(n^) times. 

Proof: By the algorithm, a call to REORDER for which |^| -|- = leads to a swap by itself. By 
Corollary [H there are at most n? swaps. Thus there are at most calls to Reorder for which 
1^1 -I- \B\ = 0. By Theorem m there are O(n^) calls to Reorder for which -|- \B\ > 0. Therefore, 
there are 0{n'^) calls to Reorder in total. □ 

Let S = {{u,v) : there is a swap of u and v such that u < v while inserting the edges}. Define 



Since by Corollary [H each vertex pair is swapped at most once, D{u,v) is well defined. Let A; be a 
number with 1 < k < n. Define 



is 0(n2). 



equal to n^. 




V(u, v) G S; 
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The following theorem is the key to our runtime analysis. 

Theorem 5: For all k G [l,n] with k > n^'^, we have ^k{u,v) = 0{'n? ■ Vk). 

Proof: Let k = rf . Let T* denote the final topological order. Define x{T* {u),T* (v)) = D{u,v) and 
z{T* {u),T* (v)) = Dk{u,v) for all vertices u and v. The following linear inequalities are proved to be 
true by Ajwani et al. ^J. 

(1) x{i,j) < for all 1 < i < n and 1 < j < i. 

(2) x{i,j) < n for all 1 < i < n and i < j < n. 

(3) Y^j>iX{i,j) - Y.j<iX{3,i) < n for all 1 < « < n. 

It is easy to derive the following linear inequalities from the definitions of x{i,j) and z{i,j). 

(4) z{i,j) < rf for all 1 < i, j < n. 

(5) z{i,j) < x{i,j) for all 1 < i, j < n. 

(6) < z{i,j) for all 1 < i < n and 1 < j < n. 

(7) < x{i,j) for all 1 < i < n and 1 < j < n. 

We aim to estimate an upper bound on the objective values of the following linear program. 

max 2:(i,j) such that 

l<«,j<n 

(1) x{i,j) < for all 1 < i < n and 1 < j <i 

(2) x{i, j) < n for all 1 < i < n and i < j < n 

(3) Ylj>iX{i,j) - 'Ej<iX{j,i) < n for all 1 < i < n 

(4) z{i,j) < rf for all 1 < i, j < n 

(5) z{i,j) < x{i,j) for all 1 < i, j < n 

(6) < z{i,j) for all 1 < i < n and 1 < j < n 

(7) < x{i,j) for all 1 < « < n and 1 < j < n. 

In order to prove the upper bound on the objective values of the above linear program, we consider 
its dual problem. 



mm 



0<i<j<n 0<i<n 0<ij<n 



such that 



(1) Yi.n+j - Wi.n+j > for all < f < n and < i < i 

(2) Yi-n+j — Wi.n+j + Yn2j^i — Yn2^j > for all < i < n and j > i 

(3) Zi.n+j + Wi.n+j > 1 for all < i < n and < j < n 

(4) > for all < i < + n 

(5) Zi>0 for ah < i < 

(6) Wi>0 for ah < i < n^. 

Let c be a large enough constant, e.g. 120, such that (i + c • rfl'^Yl'^ > (i'^/^ + 1) for any 1 < i < n. 
The following is a feasible solution to the dual problem. 
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= 1 


for 


all 
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and 


< i < i 


^i-n+j 
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for 


all 
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and 


< i < i 


Wi.n+j 


= 1 


for 


all 
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i 
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n 


and 


< i < i 


^i-n+j - 
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for 


all 
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i 
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and 


i < j < i + c ■ n^l"^ 


^i-n+j 


= 1 


for 


all 





< 
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n 


and 


i < j < i + c ■ n^^'^ 


Wi.n+j 


= 


for 


all 
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n 


and 


i < j < i + c ■ n'"/'^ 


"^i-n+j - 
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for 


all 





< 


i 


< 


n 


and 


i + c - rCl"^ < j < n 


^i-n+j 


= 


for 


all 





< 


i 


< 


n 


and 


i + c - rCl"^ < j < n 


Wi.n+j 


= 1 


for 


all 





< 


i 


< 


n 


and 


i + c - n*"/^ < j < n 


Y -> ■ - 
^ n^+i 


= (n - iy/^ 


for 


all 





< 


i 


< 


n 







This feasible solution to the dual problem has an objective value of 0{n X^ILi ^'^'^'^ + n • n*" • c • n'"/^) = 
©(n^"*"^/^ + 71-^"'"'"^''/^) = 0(n^^^/^) = 0{n^ ■ \fk), which by the primal-dual theorem is an upper bound 
on the objective values of the original linear program. □ 

Lemma 8: Given a DAG G with a valid topological order and two vertices u and v with u < v, 
consider a call to REORDER(n, v). If A' = {w : u ^ w and w < v} = f}>, then when executing this call, 
we shall have the first flag /i = 1 for all subsequent calls to REORDER(ti, •). 

Proof: We prove the lemma by induction on the depth of the recursion tree. By Lemma El the call 
to Reorder(u, v) will stop, so the depth of the recursion tree is finite. If the depth is zero, then no 
subsequent recursive calls are made, so the lemma follows. 

Assume the lemma to be true when the depth of the recursion tree is less than k, where fc > 0. 
We shall prove that the lemma is true when the depth of the recursion tree is k. Since A; > 0, there 
is at least one recursive call. Thus the for-loops are executed. By Lemma [H A = 0. By the local 
property (Lemma [2|) and the order in which we make the recursive calls, any subsequent call to Re- 
ORDER(m, •) must occur during the execution of last iteration of the outer for-loop. Consider any flrst 
level recursive call Reorder(u', w', /(, /g) in the last iteration of the outer for-loop. Note that we 
have u' = u < v' and {w : u ^ w and w < v' < v} A' = 9 when this call is ready to be executed. 
Since u' = u and ^ = 0, we also have /{ = 1. By the induction hypothesis, we also have the first flag 
/i = 1 for all subsequent calls to Reorder(u, •) while executing this call. It completes the proof. □ 

Lemma 9: Given a DAG G with a valid topological order and two vertices u and v with u < v, 
let A' = {w : u ^ w and w < v} = 9. Then a call to REORDER(ii, v) will stop with u at the initial 
position of v. That is, letting Ti,efore be the topological order just before the call to REORDER(n, u), 
then the call to REORDER(it, t;) will return a topological order Tafter such that Tafter{u) = Tbefore{v)- 

Proof: We prove the lemma by induction on the depth of the recursion tree. By Lemma [3l the call 
to REORDER(ii, v) will stop, so the depth of the recursion tree is finite. If the depth is zero, then no 
recursive calls are made. It follows that line 13 is executed, so the lemma follows. 

Assume the lemma to be true when the depth of the recursion tree is less than k, where A; > 0. 
We shall prove that the lemma is true when the depth of the recursion tree is k. Since A; > 0, there 
is at least one recursive call. Thus the for-loops are executed. Let T^efore be the initial topological 
order. By Lemma [H Lemma [21 and the induction hypothesis, the following loop invariants hold: 
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1. T is a valid topological order. 

2. At the start of the execution of line 19, T{u) = T{u') < T(v') = Tfeg/oreC^')- 

3. At the start of the execution of line 19, u' /> v' . 

4. After the execution of line 19, T(u) = TbejoreW)- 

5. The flags are correctly set for the recursive call. 

Note that for the last recursive call REORDER(ii', v') in the for-loops, we have v' = v. Thus by the 
loop invariants, we have T(u) = Tbe/ore(^) after the last recursive call in the for-loops. □ 

Lemma 10: Given a DAG G with a valid topological order and two vertices u and v with u < v, 
when executing a call of REORDER(ii, v) in which both Ai and Aj+i are computed, u will be moved 
right with distance at least tj. 

Proof: Since both Ai and Aj+i are computed, by the algorithm, we have ti < d{u, v) < tj+i and 
= 0. There are two cases to consider. 

Case 1: Ai^i = 0. It follows that A' = {w : u ^ w and w < v} = By Lemma [9l u will be 
moved right to the initial position of v. Since initially d{u, v) > ti, u will be moved right with distance 
at least tj. 

Case 2: Ai^i ^ 0. By the algorithm, the for-loops are executed. Let u be the vertex with lowest 
topological order in ^i+i. Let TiniUai be the initial topological order. Since Ai = 0, we have initially 
d{u,u) > ti, i.e., Tinitiai{u) — Tinitiai{v) > ti. By Lemma [3] and the order in which we make the 
recursive calls, before the last iteration of the outer for-loop, T(v) > TiniUaiiu) will hold. Consider 
the execution of the last iteration of the outer for-loop. Let Tstart be the topological order at the start 
of this iteration. Then we have Tstart{v) - TinHiaiiu) > TiniUaiiu) - Tinitiai{u) > U- By Lemma [3] and 
Lemma [HI the following loop invariants hold. 

1. T is a valid topological order. 

2. At the start of the execution of the last iteration of line 19, T(u) = T{u') < T{v') = TgtartW). 

3. At the start of the execution of line 19, u' /> v' . 

4. At the start of the execution of line 19, {w : u ^ w and u; < f' < = 0. 

5. After the execution of line 19, T{u) = TstartW)- 

6. The flags are correctly set for the recursive call. 

Thus after this iteration, we will have T(u) = Tgtartiy) > Tinitiaiiu) + ti, and the lemma follows. □ 

Theorem 6: While inserting a sequence of edges, there are at most 0( " ^*'^ -) calls of Reorder for 
which both Ai and Aj+i are computed. Similarly, there are at most 0( " ^*'^ -) calls of Reorder for 
which both Bi and -Bi+i are computed. 

Proof: We shall only prove that there are at most 0( " ^*''^ ) calls of Reorder for which both Ai 
and ylj+i are computed. It can be proved in a similar way that there are at most 0( " ^f^ ) calls of 

Reorder for which both Bi and Sj+i are computed. 

Let Ci{u, v),C2{u,v), . . . , C^(„ ,;)(«, v) be the calls to Reorder(m, v) for which both Ai and Aj+i 
are computed for all vertices u and v. Let Si{u,v) = {w : Ci{u,v) leads to a swap of u and w} 
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for i = l,...,m{u,v). We shall prove that Ei^i E«,e5,(«,i>) -^("' ^) = 0{n'^^/U+l). By 

Lemma [T0| J2weSi{uv) ^i''^^'^) — ^* vertices u and v and i = 1, . . . ,m{u,v). It follows that 

l^u,v 2^i=l ^ — ^\ ti >■ 

In a call to Reorder(u, w), Ai and ^j+i are both computed only if ti < d{u,v) < tj+i. Thus by 
the local property (Lemma [3l), we have D{u,w) < ij+i for all w £ Si{u,v), i = 1,... ,m{u,v). By 
Theorem O we have "^uv ^t{u,v) = 0(n^-y/fj+i). Thus it suffices to show that in the summation 

Z^u j; Yl'illi'^^ "l^weSiiu v) -^(^)^)) D^u^w) is counted at most twice for each vertex pair (n, w). 

To show that in the summation Ylu v Yl^i'""^ "l^w&Siiu v) -^(^' D{u, w) is counted at most twice 
for each vertex pair {u,v), we only have to prove that Si{u,v) n Sj{u,v') n Sk{u,v") = if Ci{u,v), 
Cj{u,v'), and Ck{u,v") are three different calls. 

Suppose for the contradiction that w G Si{u,v) n Sj{u,v') n Sk{u,v") and Ci{u,v), Cj{u,v'), 
and Ck{u,v") are three different calls. Without loss of generality, we assume Ci{u,v) occurs be- 
fore Cj{u,v') and Cj{u,v') occurs before Ck{u,v"). By Corollary [U there is only one swap of 
u and w, so Cj{u,v') must be a subsequent recursive call which occurs during the execution of 
Ci{u,v) and Ck{u,v") must be a subsequent recursive call which occurs during the execution of 
Cj{u,v'). Consider the execution of Ci{u,v). By Lemma [3] and the order in which we make the 
recursive calls in the for-loops, Cj{u,v') must occur during the last iteration of the outer for- 
loop. Note that by Lemma El before the last iteration of the outer for-loop begins, all vertices in 
= {w : u ^ w, d{u, w) < tj+i and w < v} = {w : u ^ w and w < v} are moved to the right of 
V. Thus when the last iteration of the outer for-loop begins, there are not any vertices w between u 
and V with u ^ w. Therefore, by the local property of Reorder, during the last iteration of the outer 
for-loop, for each call to REORDER(n, v'), we have {w : u ^ w and w < v' < v} = 9. By Lemma[8l we 
have the first flag /i = 1 for each subsequent call to Reorder(u, •) during the execution of Cj{u,v'). 
It follows that the first flag /i of the call Ck{u,v") is 1. Thus by the algorithm, we don't compute 
in the call Ck{u,v"), which is a contradiction. □ 

5.2 Runtime Analysis 

Lemma 11: While inserting a sequence of edges, the total time spent on executing line 2 of Insert 
is 0(n2). 

Proof: As discussed in Section 13.31 each execution of line 2 of Insert can be done in 0(1) time. 
Since there are at most n(n — l)/2 edge insertions, the lemma follows. □ 

Lemma 12: While inserting a sequence of edges, the total time spent on computing Ai and Bi, 
i = 1, . . . ,p, over all calls of REORDER(ii, t;) with ti < d{u,v) < tj+i is O(n^). 

Proof: As discussed in Section [3^31 it needs 0(|^i| + \Bi\ + 1) time to compute Ai and i3i in a call 
of Reorder('u, f ) if d{u,v) < tj+i. By Theorem [H the summation of + |-Bj|, i = 1, . . . ,p, over all 
calls of Reorder is O(n^). By Corollary [2l Reorder is called O(n^) times. Thus the summation of 
+ + 1) over all calls of Reorder is O(n^), and the lemma follows. □ 

Lemma 13: For each i with 1 < i < p + 1, while inserting a sequence of edges, the total time spent 

^2^3/2 

on computing Ai and Bi, over all calls of REORDER(n, with d[u,v) < ti is 0{^-^ — ). 
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Proof: If d{u,v) < ti, then by the algorithm, Ai is computed only if Ai-i is also computed and is 
empty. Thus by Theorem [6l there are at most 0{ ^. ) such calls. As discussed in Section [3131 it 
needs 0{ti) time to compute Ai in each of such calls. Thus the total time spent on computing Ai over 
all calls of REORDER(n, f) with d{u,v) < ti is 0{—j--^ — ). Similarly, the total time spent on computing 

„2^3/2 

Bi over all calls of Reorder(u, -u) with d{u,v) < U is 0{ ^_ ). □ 

Lemma 14: While inserting a sequence of edges, the total time spent on computing Aq and Bq over 
all calls of Reorder is 0(n^ • to). 

Proof: As discussed in Section [3.31 it needs O(to) time to compute and Bq in a call of Reorder. 
By Corollary [21 Reorder is called 0{v?) times, and the lemma follows. □ 

Lemma 15: While inserting a sequence of edges, the total time spent on swapping vertices is 0{n?"^). 

Proof: As discussed in Section 13. 3[ each swap of vertices u and v with d{u, v) < t can be done in 
d{d{u,v)) time. By Theorem [5l ^ = 0(n^-^). Thus the total time is d{n^-^). □ 

^2^3/2 

Theorem 7: While inserting a sequence of edges, the total time required is 0(X^f=i 't\ — '~ " ^o)" 
Proof: It follows directly from the above lemmas. □ 



6 Further Discussion 

Let vP'^ < to < ti < . . . < tp < tp^i = n and p = O(logn). We have known that the runtime is 

_^2j3/2 

^(X]f=i ~r^7 — I" ■ ^o)- In this section, we show how to determine the values of these parameters. 
By letting 



3 T,2^3/2 2.3/2 „2^3/2 2.3/2 



r? ■ to, 



V^P+l tp-i tp-2 to 

we have 

^5/3 

ti = t^'^, ti = for alH = 2, . . . ,p + 1. 



%-2 



Let ii = ig' for i = 0, . . . ,p + 1. we have 



Xo = 1^X1= 4/3, and Xi = ^'^^ ^ ^— ^ for all i = 2, . . . ,p + 1. 



By solving this linear second order recurrence relation, we get Xi = 2 — (2/3)* for all i = 0, 1, 2, . . . ,p+l. 
It follows that tp+i = ^"^^^^ . Since tp+i = n, we have to = nf^P\ where f{p) = (2.3^+1-2^+1) ■ 

Corollary 3: While inserting a sequence of edges, the total time required is 0(n^+-^(P-*) if to = n^^^^ 
and ti = for alH = 1, . . . ,p + 1, where f{p) = (2.3P+I-2P+I) ■ 
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Note that f{p) = (2.3P+I-2P+I) = 0-5 + (2.3P+I-2P+I) • letting e{p) = (2.3p+i_2p+1) ^ have 
< 3I+T < (3/2) P. By letting p — log3^2^; have 1 < n^^^^ < n^/" < 2 when n > 2. Thus 
0(n^+-^(p)) = 0(n^'^^'^^P'') = 0(n^'^) if we choose p = [log3y2 • The following theorem summarizes 
our discussion. 

Theorem 8: There exists an 0(n^'^ )-time algorithm for online topological ordering. 

7 Concluding Remarks 

We propose an 0(n^'^)-time algorithm for maintaining the topological order of a DAG with n ver- 
tices while inserting m edges. By combining this with the result in we get an upper bound of 
0(min{m^/^, n^'^}) for online topological ordering. The only non-trivial lower bound is due to Rama- 
lingam and Reps [8], who show that any algorithm need r2(n log n) time while inserting n — 1 edges 
in the worst case if all labels are maintained explicitly. Precisely, our algorithm runs in 0{v?'^ log^ n) 
time. Choosing a better implementation for the pails, like data structures discussed in Section 5 of [l], 
can further drop one log factor from the runtime. 
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